Folding of small proteins: A matter of geometry? 
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I. INTRODUCTION 

During the 1960s the Nobel Laureate Christian An- 
finsen showed through in vitro experiments that globu- 
lar proteins are capable of spontaneously self-assemble 
into their complex, three-dimensional native (i.e., biolog- 
ically active) structures through the process of protein 
folding 1]. This finding suggested that the only infor- 
mation required for a given protein sequence to fold into 
its unique native state is the sequence itself, an idea that 
gave rise to the so-called "protein folding" problem: The 
prediction of the native fold from the knowledge of the 
protein sequence. This question which, in more funda- 
mental terms, concerns the understanding of the physical 
principles involved in the mechanisms of folding and how 
the latter, together with the observed folding rates, are 
encoded in the protein's amino acid sequence has domi- 
nated the folding literature up to the late 1990s. 

In the late 1960s Cyrus Levinthal raised a problem re- 
lated to Anfinsen's observation that the native state is 
the global minimimum of the free energy 0]. By us- 
ing a simple counting argument, Levinthal quantified the 
magnitude of the folding process and concluded that pro- 
teins would not fold in a reasonable amount of time if the 
search for the native conformation is performed randomly 
as may be inferred from Anfinsen's observation, an idea 
that became known as the Levinthal paradox. 

An important contribution to protein folding research, 
that proposes a new scenario for folding, is that of the 
energy landscape theory (ELT), put forward by Bryn- 
gelson and Wolynes in the late 1980s 0|. These au- 
thors recognised that akin to spin glasses proteins ex- 
hibit frustration. Indeed, two kinds of frustration may 
be distinguished in proteins: energetic frustration, owing 
to unfavourable interactions present in the native state, 
and geometric frustration that leads to energy barriers 
between structurally related conformations and results 
from geometric constraints (e.g., chain connectivity). A 
possible scenario is an energy landscape (i.e., the free 
energy as a function of one or more conformation coordi- 
nates) similar to that of random hctcropolymers (RHP), 
with many local minima separated by energy barriers and 
where high energy barriers are likely to produce kinetic 



traps, that is, long-lived low free energy conformations. 
However, analytical studies of RHPs together with Monte 
Carlo simulations provided evidence that, by contrast 
with typical RHPs, the energy landscapes of kinetically 
foldable lattice proteins are smooth, with a large free 
energy gap separating the native state from misfoldcd 
conformations 4], and its overall slope is such that the 
protein is easily driven down to the native state (i.e., it 
must be funnel shaped) [5j- Lattice proteins are models 
that reduce the protein backbone to a string of single 
site beads. While it is true that such minimalist models 
do not capture the full complexity of real proteins they 
are non-trivial models that describe some potentially rel- 
evant aspects of protein folding kinetics. See for a 
recent review on computational methods in protein fold- 
ing that discusses the strengths and limitations of lattice 
models. 

In addition to providing deep new insights into the 
protein folding process the 'energy landscape perspec- 
tive' has stimulated an invaluable synergy between the- 
oretical and experimental research in the field of protein 
folding. In particular, it allowed for "new interpreta- 
tions of existing experiments and has led to the design 
of new strategies to probe the details of the folding pro- 
cess" 10] . A clear example of successful theoretical pre- 
diction that shaped modern thinking about protein fold- 
ing kinetics is that of the concept of nucleation-growth 
(or -condensation) mechanism. The latter was firstly dis- 
covered by Shakhnovich and his collaborators in the con- 
text of a lattice model and Monte Carlo folding simula- 
tions and confirmed by Fersht using a protein en- 
geneering method termed </>- value analysis 9]. Further- 
more, the above mentioned criteria for kinetically fold- 
able lattice-polymers have been tested for real proteins. 
The experimental results suggest that, in spite of their 
success in predicting biologically relevant time frames for 
folding, smooth energy landscapes and large energy gaps 
do not account for the remarkable six order of magnitude 
range characteristic of the folding rates of real two-state 
folders ^l|. By contrast, a parameter of native geom- 
etry, that measures the average sequence separation of 
contacting residue pairs in the native fold, named con- 
tact order (CO), appears to be strongly coupled to the 
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kinetics of many small (with w up to 120 amino acids) 
protein molecules exhibiting two-state folding kinetics. 
Indeed, Plaxco et al. 0, E| found a strong correlation 
(r = 0.92) between the CO and the folding rates of 24 
single-domain, two-state folders. More recently, other 
measures of the native geometry were proposed and were 
found to correlate as well as the CO with real two-state 
folding kinetics 0, 0], supporting the idea that the 
physics underlying the folding mechanism of these fold- 
ers, although 'encoded' in the protein's primary sequence, 
may not depend strongly on its finer details. These obser- 
vations triggered a renewed interest in the effects of the 
native state's geometrical properties on protein folding, 
an issue originally addressed by Go and Taketomi in a 
pioneering study based on a two-dimensional protein lat- 
tice model where the relative role of local and non-local 
contacts in the energetics and kinetics of folding ^(| were 
investigated. 

In this paper we review some recent results obtained 
within the scope of lattice models and Monte Carlo sim- 
ulations that address the role of native geometry in de- 
termining the folding kinetics of small lattice proteins. 

II. LATTICE MODELS AND MONTE CARLO 
SIMULATIONS 



The Shakhnovich model 

By contrast with the Go model, which ignores the pro- 
tein's chemical composition, the Shakhnovich model 01 
addresses the dependence of protein folding dynamics on 
the amino acid sequence by considering interactions be- 
tween the 20 different amino acids used by Nature in 
the synthesis of real proteins. Accordingly, the contact 
Hamiltonian that defines the energy of each conformation 
is such that 

N 

H(Wi}, in}) = £ c(ffi, Oi)A(r! - rj), (2) 

i>3 

where {<Ji\ represents an amino acid sequence, and Ui 
stands for the chemical identity of bead i. In this case 
both native and non-native contacts contribute energeti- 
cally to the folding process. Therefore, the contact func- 
tion A is 1 if any two beads i and j are in contact but not 
covalently linked and is otherwise. The interaction pa- 
rameters e are taken from the 20 x 20 Miyazawa- Jernigan 
(MJ), derived from the distribution of contacts of real na- 
tive proteins 



The lattice-polymer model discretizes space by embed- 
ding the protein in a regular three-dimensional lattice. 
The protein is reduced to its backbone structure: amino 
acids are represented by beads of uniform size, occupying 
the lattice vertices and the peptide bond, that covalently 
connects amino acids along the polypeptide chain, is rep- 
resented by sticks, with uniform length, corresponding to 
the lattice spacing ( Figure^. In order to satisfy the ex- 
cluded volume constraint only one bead is allowed per 
lattice site. 

The Go model 

In the Go model the energy of a conformation, defined 
by the set of bead coordinates {fl}, is given by the con- 
tact Hamiltonian 

N 

H({ri})=J2 eA ( f ' i -f j ), (1) 

where the contact function A(r| — fj), is unity only if 
beads i and j form a non-covalent native contact, i.e., 
a contact between a pair of beads that is present in the 
native structure, and is zero otherwise. The Go potential 
is based on the idea that the native fold is very well 
optimised energetically. Accordingly, it ascribes equal 
stabilizing energies (e < 0) to all the native contacts and 
neutral energies (e = 0) to all non-native contacts. 



Simulation details 

To mimic protein motion in simple lattice models, a 
Monte Carlo (MC) algorithm is used together with the 
kink-jump move set. This means that local random dis- 
placements of one or two beads are repeatedly acce pted 
or rejected in accordance with the Metropolis rule |19j . 
A MC run starts from a randomly generated unfolded 
conformation (typically with very few native contacts) 
and the folding dynamics is traced by following the evo- 
lution of the fraction of native contacts, Q = q/Q ma x, 
where Q m ax is the total number of native contacts for 
each chain length, and q is the number of native contacts 
at each MC step. Kinetic quantities such as the folding 
time, t, is taken as the first passage time (FPT), that is, 
the number of MC steps that corresponds to Q = 1.0. 
All native structures considered are maximally compact 
cuboids found by homopolymer relaxation [36j. 

The folding dynamics is studied at the so-called opti- 
mal folding temperature, the temperature that minimizes 
the folding time as measured by the mean FPT [U I22I 

mm. 

The sequences studied within the context of the 
Shakhnovich model were prepared using the design 
method developed by Shakhnovich and Gutin (SG) [2fJ 
based on random heteropolymer theory and simulated 
annealing techniques. 
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FIG. 1: A maximally compact conformation (MCC), repre- 
senting the native structure, for a 48 bead long chain in a 
cubic lattice. MCCs, with a maximal number of contacts 
between amino acids for each chain length, mimic the high 
degree of compactness that characterises real protein native 
structures. 
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FIG. 2: Dependence of the folding probability, Pfoid, on 
log e (t). Pfoid was computed as the fraction of folding simula- 
tions that ended up to time t normalized to the total number 
of MC runs performed for each chain length. For TV up to 64 
the curves are consistent with asymptotic values of Pfoid — * 1. 
For TV > 80 this dynamical behaviour breaks down and the 
asymptotic value of Pfoid decreases considerably |24|. 



III. CHAIN LENGTH AS A DETERMINANT OF 
FOLDING KINETICS 

There is empirical evidence that the protein's chain 
length, N, grossly determines the kinetics of folding ac- 
cording to the following rule: for TV up to w 100 amino 
acids the kinetics is two-state, while for larger TV pro- 
tein chains typically exhibit non-two-state kinetic be- 
haviour [25| (there are of course exceptions to this rule 
and it is possible to find proteins with chain lenght 
TV < 110 that fold via a three-state kinetics H^). When 
the kinetics is two-state, folding proceeds in the absence 
of any observable intermediates and there is a single tran- 
sition state associated with one major free energy bar- 
rier separating the native from the unfolded conforma- 
tions [2q . 

What do lattice models tell us about the role of chain 
length in theprotein folding kinetics? Results obtained 
by one of us |2J] in the context of the Shakhnovich lattice- 
polymer model showed that, depending on the chain 
length, two dynamical regimes for folding can be identi- 
fied (Figure 0). In the regime found for TV > 80 the fold- 
ing performance depends on the native state's structure, 
with certain structures being kinetically more accessible 
(i.e., more easily foldable) than others. This observation 
is in agreement with results reported in Ref. |2t| where 
it is found that, for 125 bead long proteinlike sequences, 



efficient folding depends on structural features of the na- 
tive state (e.g., the distribution and position of contacts 
in the native structure). While it is likely that kinetic 
traps dominate folding in this regime (akin to what hap- 
pens in the folding of real longer proteins) we have not 
conducted our MC simulations long enough to conclude 
that this is actually the case. 

For TV < 80, as in real two-state proteins, we have 
found that kinetic relaxation is well described by a sin- 
gle exponential law (a distinguishing feature of a two- 
state kinetic process) with the reactant concentration 
(the equivalent in our simulations to the fraction of un- 
folded chains) being proportional to exp _fc * where k is 
the so-called relaxation rate constant (Fig. [3j . 

As for the dependence of folding time on the chain 
length we have found a scaling law of the type t ~ TV 5 
while, for the same model, Gutin et al. \2l\ have reported 
that t and N scale as t ~ TV 4 . For the Go model, on the 
other hand, a weaker dependence of t ~ TV 3 has been 
observed Hi H3- These findings are in broad agree- 
ment with Thirumalai's theoretical prediction that, near 
the point of thermodynamic equilibrium between the un- 
folded coil and the native fold, i.e., at the folding transi- 
tion temperature, the folding time and protein size scale 
as t ~ TV A with A between 3.8 and 4.2 |28|. For real pro- 
teins, however, it has been shown that the dependence 
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FIG. 3: Evidence for single exponential kinetic relaxation for 
N = 48 and N = 64 but not for N = 100. The correlation 
coefficient between the logarithmic fraction of unfolded chains 
(i.e., the 'reactant concentration') and the folding time t is 
r = 0.99 for N = 48 and N = 64 Q. 



of the folding time on protein chain length is weak (r = 
0.16) ^3 1 an d recent results by Galzitskaya et al [2j| sug- 
gest that protein size is the main determinant of folding 
rates for multi-state proteins only. We note that lattice 
simulations use single or double residue MC move sets 
which may differ very significantly from the collective 
motions of large portions of the chain that can occur in 
continuum space. Thus the dependence of the folding 
times of lattice polymers on the chain length could differ 
significantly from that observed in real proteins. This 
shortcoming of the model may be particularly relevant 
for large protein molecules. 



IV. NATIVE GEOMETRY AS A DETERMINANT 
OF FOLDING RATES 

In 1998 Plaxco et al. 0] proposed the relative con- 
tact order parameter, CO, as a simple empirical measure 
of the native structure's geometric complexity. The CO 
measures the average sequence separation of contacting 
residue pairs in the native structure relative to the chain 
length of the protein and is defined as 



CO 
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where Ajj- = 1 if residues i and j are in contact and is 
otherwise; N is the total number of contacts and L is the 
protein chain length. In a subsequent study Plaxco and 



co-workers reported a rather strong correlation (r = 0.92) 
between the CO parameter and the logarithmic folding 
rates of 24 single domain, two-state folders 0] , suggest- 
ing that the native's state geometry could be the major 
determinant of two-state folding kinetics. 

Do lattice models reproduce the experimentally ob- 
served correlation between folding rates and the CO? 
We have recently addressed this question in the context 
of the Shakhnovich model. A set of 20 target native 
structures, selected in order to cover the range of CO ob- 
served in real proteins, was investigated for chain lengths 
N = 36, 48, 54, 64 and 80. A correlation of r = 0.70-0.79 
was found between increasing CO and longer logarithmic 
folding times for chain lengths N > 54 [3J|. There is a 
potential shortcoming of using the MJ potential to in- 
vestigate contact order correlations with folding rates in 
lattice models. Indeed, it was shown recently that the 
energy landscapes of real two-state folders are relatively 
smooth [37], [33 while the folding dynamics associated 
with the MJ potential is prone to energetic traps (that re- 
sult from competing interactions between pairs of beads) 
that may lead to considerably rough energy landscapes. 
Moreover, if the geometry-related kinetics of real two- 
state folders is favoured by their smooth energy land- 
scapes and if the relevant topological effects are subtle 
they could be masked by stronger kinetic effects ari sing 
in sequence-specific models such as the Shakhnovich [3Jj . 
Following this line of reasoning Jewett et al. [3{J studied 
the correlation of the folding time on the contact order 
parameter in the context of the Go model. Since the 
Go potential considers only attractive interactions be- 
tween native contacts the corresponding folding dynam- 
ics is prone to geometric traps only, i.e., traps that result 
from chain connectivity and the geometry of the native 
fold. Thus, as real two-state proteins, lattice-polymers 
modelled by Go and Go-type interaction schemes exhibit 
relatively smooth energy landscapes making them partic- 
ularly suitable models to investigate the role of the native 
state's geometry in the folding kinetics. Notwithstand- 
ing, Jewett et al. found a very poor correlation, r « 0.23, 
between logarithmic folding times and the CO in a pool 
of targets comprising 97 different native structures with 
chain length iV = 27. A modified version of the original 
Go model, characterized by a non-linear relationship be- 
tween the native state's energy and the number of native 
contacts formed during folding, i.e., that enhances fold- 
ing cooperativity, exhibits a stronger correlation (r=0.75) 
between the logarithmic folding time and the contact or- 
der [3^. Although this correlation is weaker than the 
correlation observed experimentally (and lower than that 
found by us for the Shakhnovich model) this finding sug- 
gests that the physical mechanism underlying geometry- 
dependent kinetics may be that of cooperativity. 

In a recent study we have identified different folding 
mechanisms, distinguished by different cooperativities, 
for the Shakhnovich model 401. For low contact order 
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structures we observed that the building up of the native 
fold may occur in a gradual manner, where the amount 
of native structure increases in a continuous way as time 
evolves, or in a more abrupt (or cooperative) way where a 
significant portion of native structure emerges only dur- 
ing the late stages of folding. By contrast, the folding of 
intermediate and high contact order structures is clearly 
more cooperative, in the sense described above. Indeed, 
in the latter case, it is possible to identify a clear pat- 
tern in the formation of the native fold that is driven 
by the backbone distance: local contacts (i.e. close in 
space and in sequence) form first and long-range contacts 
(i.e. close in space but distant in sequence) form progres- 
sively later as contact range increases. As a consequence 
a monotonic decrease of contact frequency with increas- 
ing contact range is observed and this trend is specific 
of the folding dynamics of low contact order structures. 
These results provide a possible explanation of the higher 
correlation found in Ref. between contact order and 
logarithmic folding times for chain length N > 54 and 
predominantly high-CO values. If cooperativity is the 
essential ingredient of geometry-dependent kinetics, as 
the results on modified Go models suggest, and if the 
long-range (LR) contacts enhance the cooperativity of 
the folding transition it is natural to expect a stronger 
correlation in high contact order structures, which have 
predominantly LR contacts. 



V. THE ROLE OF LOCAL AND LONG-RANGE 
CONTACTS: REVISITING THE GO MODEL 

Contact order is a way to quantify a protein's geometry 
that accounts for the average range of amino acid inter- 
actions in the native fold. It is interesting to note that 
in one of the very first studies that made use of a lattice- 
polymer framework to model folding, Go and Taketomi 
investigated the role played by local and long-range (LR) 
inter-residue interactions in the dynamics of this com- 
plex biological process 0. The latter has actually be- 
came one of the most widely-debated issues in the protein 
folding literature. As a result of this debate there is now 
agreement on the fact that LR interactions play an im- 
portant role in stabilizing the native fold Ho> EH E3 
but there is no consensus on their role in the folding ki- 
netics. For example, early results of Go and Taketomi 
|l6j | for a 49-residue chain on a two-dimensional square 
lattice suggest that local interactions accelerate both the 
folding and unfolding transitions. In Ref. js^ Unger and 
Moult have studied optimised heteropolymer sequences 
with chain length N — 27 on a three-dimensional cu- 
bic lattice and concluded that increasing the strength 
of local interactions increases the ability of sequences to 
fold. By contrast, results obtained by Abkevich et oi.|3fT| 
for the Shakhnovich lattice-polymer model provided evi- 
dence that, under conditions where the native state is sta- 



ble, a 36-residue sequence on a three-dimensional cubic 
lattice folds to a native structure with mostly LR contacts 
two-orders of magnitude faster than a sequence folding 
to a native structure with predominantly local contacts. 
In Ref. 13 Govindarajan and Goldstein have used a 
lattice model in conjunction with techniques drawn from 
the theory of spin glasses and found that optimal con- 
ditions for folding are achieved when local interactions 
contribute little to the native state's energy. 

The finding that the CO is correlated with the fold- 
ing kinetics of small, two-state proteins has set a new 
ground for investigating the role of local (and LR) con- 
tacts in protein folding, one where the effects of native 
geometry are taken into account. Moreover, it is natural 
to address this issue in the context of the Go model since 
lattice polymers modelled by the Go potential exhibit 
smooth free energy landscapes where the effects of native 
geometry dominate. We have revisited the Go model in 
the light of these findings to investigate the role of local 
(and LR) inter-residue interactions in the dynamics of 
folding |35j. We introduced a parameter a that weights 
the relative contributions of local and long-range inter- 
actions to the total energy of the native state. When 
a = all LR native interactions are 'switched-off' and 
only local interactions contribute to the total energy of a 
conformation. The opposite situation is observed when 
(j = 1. We considered two energy parametrizations, one 
at fixed native state's energy, and three target structures, 
with different geometries. We found that, when the na- 
tive state's energy varies with a, the native fold always 
exhibits the highest occupation probability, a measure of 
its stability, and that, with the exception of one struc- 
ture, the latter lies between 0.6 and 0.9 |35| . 

Our results show that LR interactions play a major 
role in determining the folding kinetics of 48-mer three- 
dimensional lattice polymers modelled by the Go poten- 
tial. Indeed, for three target structures, with different 
native geometries, we observed a sharp increase in the 
folding time when the relative contribution of the LR in- 
teractions to the native state's energy is decreased. How- 
ever, the kinetic response to a decrease in the relative 
contribution of the LR interactions is strongly depen- 
dent on the target geometry. In fact, we have observed a 
remarkable three-order of magnitude span in the folding 
time of Go polymers folding to one of the target struc- 
tures studied (Figure 0}. 

The existence of different kinetic responses as a func- 
tion of target geometry has a mechanistic interpretation. 
We have found that, for a given target geometry, a ge- 
ometric coupling exists between local and LR contacts. 
When this is the case, the establishment of LR contacts 
is forced by the (previous) formation of local contacts. 
The absence of this geometric coupling leads to kinetics 
that are sensitive to the interaction energy parameters; 
in this case establishment of the local contacts is not suf- 
ficient to promote the formation of the LR ones if they 
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FIG. 4: Dependence of the logarithmic folding time, log 10 (t), 
on the long-range interaction parameter, < a < 1 for three 
target geometries at fixed native state's energy. The frac- 
tion of LR contacts is 0.77 for target T3, while for targets 
Tl and T2 it is 0.48 and 0.42, respectively. For the three 
structures the folding time increases considerably faster when 
a decreases than when a increases away from the respective 
minima. The behaviour observed for T3 is trivial and results 
from its considerably high content in LR contacts. In the 
limit of o — 0, the structure is forced to fold with only 20 
per cent of its native interactions and this results in folding 
failure. The results obtained for the low- and intermediate- 
CO target structures, Tl and T2, are more interesting. The 
corresponding curves are qualitatively similar but a closer in- 
spection reveals an important difference, namely: for a < 0.5 
the dependence of the folding time on a is much stronger 
for the intermediate-CO structure, T2. Indeed, in this case 
one observes a remarkable three-order of magnitude disper- 
sion of folding times, ranging from log 10 (t TO m) = 5.76 ± 0.05 
(for a = 0.65) to log 10 (t max ) = 8.75 ± 0.05 (for a = 0.10), 
by contrast with Tl for which log 10 (tmin) = 5.50 ± 0.08 (for 
a = 0.70) and \og w (t max ) = 7.69 ± 0.09 (for a = 0.00) 



are strongly penalized energetically, resulting in longer 
folding times. 



VI. PROTEINLIKE COOPERATIVITY: A NEW 
CHALLENGE FOR LATTICE MODELS 



A well-established criterion for two-state thermody- 
namic cooperativity observed in folding experiments of 
real proteins is the calorimetric criterion introduced in 




FIG. 5: Calorimetric criterium: the distribution of enthalpy 
at the midpoint of a two-state folding transition is bimodal 
with very few molecules having enthalpies between the native 
(N) and denatured (D) states. 



the 1970s by Privalov and co-workers |4l|. The calori- 
metric criterion relates thermodynamic cooperativity to 
a bimodal distribution of the enthalpy, describing a con- 
formational population peaked around the native and the 
unfolded states and practically zero at intermediate en- 
thalpies, at the midpoint of the folding transition (Fig- 
ureEJ. More recently, it was suggested that a kinetic cri- 
terion known as the "chevron-plot" is the definitive hall- 
mark of two-state folding cooperativity 01 ■ Why? The 
"chevron-plot" is a V-shaped graph, that results from 
plotting the logarithm of k, the relaxation rate constant, 
as a function of the denaturant concentration (at con- 
stant temperature), where one observes that the folding 
rate constant, kf, dominates at low denaturant concen- 
tration while the unfolding rate constant, k u , dominates 
at high denaturant concentration. The chevron-plot was 
shown to be a more restrictive criterion for cooperative 
folding behaviour than the calorimetric criterion in the 
sense that it is not present in all of the two-state pro- 
teins that passed the calorimetric cooperative test. In 
other words, thermodynamic cooperativity is a neces- 
sary but not a sufficient condition for kinetic coopera- 
tivity (see Ref. |42( for a recent review on cooperativ- 
ity principles in protein folding). The folding of lattice 
polymers (Go and other s) appe ars to be relatively non- 
cooperative in this sense |42|,|43j (note however that, in a 
broader sense as pointed out in the previous section, dif- 
ferent degrees of cooperative behaviour may be observed 
in lattice models.). Based on these observations Kaya 
and Chan suggested that if proteinlike cooperativity 
is the main drive for geometry-dependent kinetics, that 
appears to be lacking in simple lattice models, it is not 
surprising that their folding rates are weakly correlated 
with contact order. The question is then what happens 
in these models if proteinlike cooperativity is enhanced 
? Kaya and Chan [SJ addressed this question in the 
context of a new Go-type interaction scheme based on 
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FIG. 6: A pictorial representation of a 'chevron-plot', a dis- 
tinctive feature of experimental two-state folding kinetics. 
The relaxation rate constant, k, is the sum of the folding 
and unfolding rates constants, k — kf + k u . The folding rate 
constant dominates at low denaturant concentration while the 
unfolding rate constant dominates at high denaturant concen- 
tration. 



the crucial assumption that a favourable correlation ex- 
ists between local conformational preferences and nonlo- 
cal interactions, i.e., nonlocal interactions are stabilised 
when the chain segments around the native residues are 
in their native conformations (the model is suggested by 
the experimental observation that secondary structural 
elements are stable only in the native fold |45|'). This 
correlation is most simply modelled as a (non-additive) 
many-body 'attraction' that promotes the folding of large 
portions of the native structure which in turn increases 
the energy gap between the native and unfolded confor- 
mations. Kaya and Chan have found that this gener- 
alized Go model exhibits thermodynamic cooperativity 
and linear chevron plots similar to those observed exper- 
imentally for real two-state folders. Moreover the model 
yields folding rates that are logarithmically well corre- 
lated (r = 0.94) with the contact order parameter. The 
way in which these many-body correlations arise in gen- 
eral (or are 'encoded' in the primary sequence) remains 
an open question. 



VII. CONCLUSIONS 

In the present work we have reviewed some results and 
concepts that emerged in the field of protein folding dur- 
ing the last few years, based on results obtained through 
Monte Carlo simulations of simple lattice models. 

There is experimental evidence that the folding of 
many small, single domain, two-state proteins occurs in 
smooth energy landscapes and that the folding kinetics 
is correlated to the geometry of the native state. In this 
context Go- and Go-type models are becoming increas- 



ingly popular since lattice-polymers modelled by the Go 
potential exhibit smooth energy landscapes where the 
effects of native geometry dominate. A recent finding 
obtained from simulations on these models is that the 
long-range contacts play a determinant role in the fold- 
ing kinetics and that this effect is geometry-dependent. 

However, recent results on the traditional Go poten- 
tial with additive pairwise interactions suggest that these 
models fail to capture the proteinlike cooperativity of real 
two-state folding kinetics and the question of wether real- 
istic folding may be addressed within the scope of simple 
lattice models becomes relevant. In particular, the idea 
that a geometry-dependent kinetics may result from the 
type of cooperativity exhibited by real two-state folders 
has proven a new challenge for lattice models. The devel- 
opment of new interaction schemes based on non-additive 
many-body interactions leading to Go-type models with 
protein-like cooperativity appears to be the next step to- 
wards a more realistic lattice modelling of two-state fold- 
ing kinetics. 

It is an honour and a great pleasure to contribute to 
this issue in honour of Ben Widom. Simple lattice models 
for complex fluids owe a lot to Ben's imagination and 
creativity. Thank you Ben for leading the way and Happy 
Birthday! 
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